De Finetti's Theorem
   HOME

TheInfoList



OR:

In
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
, de Finetti's theorem states that exchangeable observations are
conditionally independent In probability theory, conditional independence describes situations wherein an observation is irrelevant or redundant when evaluating the certainty of a hypothesis. Conditional independence is usually formulated in terms of conditional probabil ...
relative to some
latent variable In statistics, latent variables (from Latin: present participle of ''lateo'', “lie hidden”) are variables that can only be inferred indirectly through a mathematical model from other observable variables that can be directly observed or me ...
. An
epistemic probability Uncertainty quantification (UQ) is the science of quantitative characterization and reduction of uncertainties in both computational and real world applications. It tries to determine how likely certain outcomes are if some aspects of the system a ...
distribution Distribution may refer to: Mathematics *Distribution (mathematics), generalized functions used to formulate solutions of partial differential equations * Probability distribution, the probability of a particular value or value range of a vari ...
could then be assigned to this variable. It is named in honor of
Bruno de Finetti Bruno de Finetti (13 June 1906 – 20 July 1985) was an Italian probabilist statistician and actuary, noted for the "operational subjective" conception of probability. The classic exposition of his distinctive theory is the 1937 "La prévision: ...
. For the special case of an exchangeable sequence of
Bernoulli Bernoulli can refer to: People *Bernoulli family of 17th and 18th century Swiss mathematicians: ** Daniel Bernoulli (1700–1782), developer of Bernoulli's principle **Jacob Bernoulli (1654–1705), also known as Jacques, after whom Bernoulli numbe ...
random variables it states that such a sequence is a "
mixture In chemistry, a mixture is a material made up of two or more different chemical substances which are not chemically bonded. A mixture is the physical combination of two or more substances in which the identities are retained and are mixed in the ...
" of sequences of
independent and identically distributed In probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is usua ...
(i.i.d.) Bernoulli random variables. A sequence of random variables is called exchangeable if the joint distribution of the sequence is unchanged by any permutation of the indices. While the variables of the exchangeable sequence are not ''themselves'' independent, only exchangeable, there is an ''underlying'' family of i.i.d. random variables. That is, there are underlying, generally unobservable, quantities that are i.i.d. – exchangeable sequences are mixtures of i.i.d. sequences.


Background

A Bayesian statistician often seeks the conditional probability distribution of a random quantity given the data. The concept of
exchangeability In statistics, an exchangeable sequence of random variables (also sometimes interchangeable) is a sequence ''X''1, ''X''2, ''X''3, ... (which may be finitely or infinitely long) whose joint probability distribution does not change whe ...
was introduced by de Finetti. De Finetti's theorem explains a mathematical relationship between independence and exchangeability. An infinite sequence :X_1, X_2, X_3, \dots of random variables is said to be exchangeable if for any
natural number In mathematics, the natural numbers are those numbers used for counting (as in "there are ''six'' coins on the table") and ordering (as in "this is the ''third'' largest city in the country"). Numbers used for counting are called ''Cardinal n ...
''n'' and any finite sequence ''i''1, ..., ''i''''n'' and any permutation of the sequence π: → , :(X_,\dots,X_) \text (X_,\dots,X_) both have the same
joint probability distribution Given two random variables that are defined on the same probability space, the joint probability distribution is the corresponding probability distribution on all possible pairs of outputs. The joint distribution can just as well be considered ...
. If an identically distributed sequence is
independent Independent or Independents may refer to: Arts, entertainment, and media Artist groups * Independents (artist group), a group of modernist painters based in the New Hope, Pennsylvania, area of the United States during the early 1930s * Independ ...
, then the sequence is exchangeable; however, the converse is false—there exist exchangeable random variables that are not statistically independent, for example the
Pólya urn model In statistics, a Pólya urn model (also known as a Pólya urn scheme or simply as Pólya's urn), named after George Pólya, is a type of statistical model used as an idealized mental exercise framework, unifying many treatments. In an urn model, ob ...
.


Statement of the theorem

A
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
''X'' has a
Bernoulli distribution In probability theory and statistics, the Bernoulli distribution, named after Swiss mathematician Jacob Bernoulli,James Victor Uspensky: ''Introduction to Mathematical Probability'', McGraw-Hill, New York 1937, page 45 is the discrete probabil ...
if Pr(''X'' = 1) = ''p'' and Pr(''X'' = 0) = 1 − ''p'' for some ''p'' ∈ (0, 1). De Finetti's theorem states that the probability distribution of any infinite exchangeable sequence of Bernoulli random variables is a "
mixture In chemistry, a mixture is a material made up of two or more different chemical substances which are not chemically bonded. A mixture is the physical combination of two or more substances in which the identities are retained and are mixed in the ...
" of the probability distributions of independent and identically distributed sequences of Bernoulli random variables. "Mixture", in this sense, means a weighted average, but this need not mean a finite or countably infinite (i.e., discrete) weighted average: it can be an integral rather than a sum. More precisely, suppose ''X''1, ''X''2, ''X''3, ... is an infinite exchangeable sequence of Bernoulli-distributed random variables. Then there is some probability distribution ''m'' on the interval , 1and some random variable ''Y'' such that * The probability distribution of ''Y'' is ''m'', and * The
conditional probability distribution In probability theory and statistics, given two jointly distributed random variables X and Y, the conditional probability distribution of Y given X is the probability distribution of Y when X is known to be a particular value; in some cases the ...
of the whole sequence ''X''1, ''X''2, ''X''3, ... given the value of ''Y'' is described by saying that ** ''X''1, ''X''2, ''X''3, ... are
conditionally independent In probability theory, conditional independence describes situations wherein an observation is irrelevant or redundant when evaluating the certainty of a hypothesis. Conditional independence is usually formulated in terms of conditional probabil ...
given ''Y'', and ** For any ''i'' ∈ , the conditional probability that ''X''''i'' = 1, given the value of ''Y'', is ''Y''.


Another way of stating the theorem

Suppose X_1,X_2,X_3,\ldots is an infinite exchangeable sequence of Bernoulli random variables. Then X_1,X_2,X_3,\ldots are conditionally independent and identically distributed given the exchangeable sigma-algebra (i.e., the sigma-algebra of events is measurable with respect to X_1,X_2,\ldots and invariant under finite permutations of the indices).


Example

Here is a concrete example. We construct a sequence :X_1, X_2, X_3, \dots of random variables, by "mixing" two i.i.d. sequences as follows. We assume ''p'' = 2/3 with probability 1/2 and ''p'' = 9/10 with probability 1/2. Given the event ''p'' = 2/3, the conditional distribution of the sequence is that the ''X''i are independent and identically distributed and ''X''1 = 1 with probability 2/3 and ''X''1 = 0 with probability 1 − 2/3. Given the event ''p'' = 9/10, the conditional distribution of the sequence is that the ''X''i are independent and identically distributed and ''X''1 = 1 with probability 9/10 and ''X''1 = 0 with probability 1 − 9/10. This can be interpreted as follows: Make two biased coins, one showing "heads" with 2/3 probability and one showing "heads" with 9/10 probability. Flip a fair coin once to decide which biased coin to use for all flips that are recorded. Here "heads" at flip i means Xi=1. The independence asserted here is ''conditional'' independence, i.e. the Bernoulli random variables in the sequence are conditionally independent given the event that ''p'' = 2/3, and are conditionally independent given the event that ''p'' = 9/10. But they are not unconditionally independent; they are positively
correlated In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
. In view of the
strong law of large numbers In probability theory, the law of large numbers (LLN) is a theorem that describes the result of performing the same experiment a large number of times. According to the law, the average of the results obtained from a large number of trials shou ...
, we can say that :\lim_ \frac = \begin 2/3 & \text1/2, \\ 9/10 & \text1/2. \end Rather than concentrating probability 1/2 at each of two points between 0 and 1, the "mixing distribution" can be any
probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...
supported on the interval from 0 to 1; which one it is depends on the joint distribution of the infinite sequence of Bernoulli random variables. The definition of exchangeability, and the statement of the theorem, also makes sense for finite length sequences :X_1,\dots, X_n, but the theorem is not generally true in that case. It is true if the sequence can be extended to an exchangeable sequence that is infinitely long. The simplest example of an exchangeable sequence of Bernoulli random variables that cannot be so extended is the one in which ''X''1 = 1 − ''X''2 and ''X''1 is either 0 or 1, each with probability 1/2. This sequence is exchangeable, but cannot be extended to an exchangeable sequence of length 3, let alone an infinitely long one.


Extensions

Versions of de Finetti's theorem for ''finite'' exchangeable sequences, and for ''Markov exchangeable'' sequences have been proved by Diaconis and Freedman and by Kerns and Szekely. Two notions of partial exchangeability of arrays, known as ''separate'' and ''joint exchangeability'' lead to extensions of de Finetti's theorem for arrays by Aldous and Hoover. The computable de Finetti theorem shows that if an exchangeable sequence of real random variables is given by a computer program, then a program which samples from the mixing measure can be automatically recovered. In the setting of
free probability Free probability is a mathematical theory that studies non-commutative random variables. The "freeness" or free independence property is the analogue of the classical notion of independence, and it is connected with free products. This theory was in ...
, there is a noncommutative extension of de Finetti's theorem which characterizes noncommutative sequences invariant under quantum permutations. Extensions of de Finetti's theorem to quantum states have been found to be useful in
quantum information Quantum information is the information of the state of a quantum system. It is the basic entity of study in quantum information theory, and can be manipulated using quantum information processing techniques. Quantum information refers to both th ...
, in topics like
quantum key distribution Quantum key distribution (QKD) is a secure communication method which implements a cryptographic protocol involving components of quantum mechanics. It enables two parties to produce a shared random secret key known only to them, which can then be ...
and entanglement detection.


See also

*
Choquet theory In mathematics, Choquet theory, named after Gustave Choquet, is an area of functional analysis and convex analysis concerned with measures which have support on the extreme points of a convex set ''C''. Roughly speaking, every vector of ''C'' sho ...
*
Hewitt–Savage zero–one law The Hewitt–Savage zero–one law is a theorem in probability theory, similar to Kolmogorov's zero–one law and the Borel–Cantelli lemma, that specifies that a certain type of event will either almost surely happen or almost surely not happen. I ...
*
Krein–Milman theorem In the mathematical theory of functional analysis, the Krein–Milman theorem is a proposition about compact convex sets in locally convex topological vector spaces (TVSs). This theorem generalizes to infinite-dimensional spaces and to arbitrar ...


References


External links

*{{SpringerEOM, id=De_Finetti_theorem, first=L., last= Accardi, title=De Finetti theorem
What is so cool about De Finetti's representation theorem?
Probability theorems Bayesian statistics Integral representations